182 research outputs found

    The development of computational biology in South Africa: successes achieved and lessons learnt

    Get PDF
    Bioinformatics is now a critical skill in many research and commercial environments as biological data are increasing in both size and complexity. South African researchers recognized this need in the mid-1990s and responded by working with the government as well as international bodies to develop initiatives to build bioinformatics capacity in the country. Significant injections of support from these bodies provided a springboard for the establishment of computational biology units at multiple universities throughout the country, which took on teaching, basic research and support roles. Several challenges were encountered, for example with unreliability of funding, lack of skills, and lack of infrastructure. However, the bioinformatics community worked together to overcome these, and South Africa is now arguably the leading country in bioinformatics on the African continent. Here we discuss how the discipline developed in the country, highlighting the challenges, successes, and lessons learnt

    Genome-wide binding of the CRISPR endonuclease Cas9 in mammalian cells

    Get PDF
    Bacterial type II CRISPR-Cas9 systems have been widely adapted for RNA-guided genome editing and transcription regulation in eukaryotic cells, yet their in vivo target specificity is poorly understood. Here we mapped genome-wide binding sites of a catalytically inactive Cas9 (dCas9) from Streptococcus pyogenes loaded with single guide RNAs (sgRNAs) in mouse embryonic stem cells (mESCs). Each of the four sgRNAs we tested targets dCas9 to between tens and thousands of genomic sites, frequently characterized by a 5-nucleotide seed region in the sgRNA and an NGG protospacer adjacent motif (PAM). Chromatin inaccessibility decreases dCas9 binding to other sites with matching seed sequences; thus 70% of off-target sites are associated with genes. Targeted sequencing of 295 dCas9 binding sites in mESCs transfected with catalytically active Cas9 identified only one site mutated above background levels. We propose a two-state model for Cas9 binding and cleavage, in which a seed match triggers binding but extensive pairing with target DNA is required for cleavage.National Institutes of Health (U.S.) (Grant RO1-GM34277)National Institutes of Health (U.S.) (Grant R01-CA133404)National Cancer Institute (U.S.) (Grant PO1-CA42063)National Cancer Institute (U.S.) (Cancer Center Support (Core) Grant P30-CA14051)National Institutes of Health (U.S.) (Director's Pioneer Award 1DP1-MH100706)Damon Runyon Cancer Research FoundationKinship Foundation. Searle Scholars ProgramSimons Foundatio

    G-quadruplex structures mark human regulatory chromatin

    Get PDF
    G-quadruplex (G4) structural motifs have been linked to transcription, replication and genome instability and are implicated in cancer and other diseases. However, it is crucial to demonstrate the bona fide formation of G4 structures within an endogenous chromatin context. Herein we address this through the development of G4 ChIP-seq, an antibody-based G4 chromatin immunoprecipitation and high-throughput sequencing approach. We find ∼10,000 G4 structures in human chromatin, predominantly in regulatory, nucleosome-depleted regions. G4 structures are enriched in the promoters and 5' UTRs of highly transcribed genes, particularly in genes related to cancer and in somatic copy number amplifications, such as MYC\textit{MYC}. Strikingly, de novo\textit{de novo} and enhanced G4 formation are associated with increased transcriptional activity, as shown by HDAC inhibitor-induced chromatin relaxation and observed in immortalized as compared to normal cellular states. Our findings show that regulatory, nucleosome-depleted chromatin and elevated transcription shape the endogenous human G4 DNA landscape.European Molecular Biology Organization (EMBO Long-Term Fellowship), University of Cambridge, Cancer Research UK (Grant ID: C14303/A17197), Wellcome Trust (Grant ID: 099232/z/12/z

    Lung function associated gene Integrator Complex subunit 12 regulates protein synthesis pathways

    Get PDF
    Background: Genetic studies of human lung function and Chronic Obstructive Pulmonary Disease have identified a highly significant and reproducible signal on 4q24. It remains unclear which of the two candidate genes within this locus may regulate lung function: GSTCD, a gene with unknown function, and/or INTS12, a member of the Integrator Complex which is currently thought to mediate 3'end processing of small nuclear RNAs.Results: We found that, in lung tissue, 4q24 polymorphisms associated with lung function correlate with INTS12 but not neighbouring GSTCD expression. In contrast to the previous reports in other species, we only observed a minor alteration of snRNA processing following INTS12 depletion. RNAseq analysis of knockdown cells instead revealed dysregulation of a core subset of genes relevant to airway biology and a robust downregulation of protein synthesis pathways. Consistent with this, protein translation was decreased in INTS12 knockdown cells. In addition, ChIPseq experiments demonstrated INTS12 binding throughout the genome, which was enriched in transcriptionally active regions. Finally, we defined the INTS12 regulome which includes genes belonging to the protein synthesis pathways.Conclusion: INTS12 has functions beyond the canonical snRNA processing. We show that it regulates translation by regulating the expression of genes belonging to protein synthesis pathways. This study provides a detailed analysis of INTS12 activities on a genome-wide scale and contributes to the biology behind the genetic association for lung function at 4q24.</p

    FOXM1 binds directly to non-consensus sequences in the human genome.

    Get PDF
    BACKGROUND: The Forkhead (FKH) transcription factor FOXM1 is a key regulator of the cell cycle and is overexpressed in most types of cancer. FOXM1, similar to other FKH factors, binds to a canonical FKH motif in vitro. However, genome-wide mapping studies in different cell lines have shown a lack of enrichment of the FKH motif, suggesting an alternative mode of chromatin recruitment. We have investigated the role of direct versus indirect DNA binding in FOXM1 recruitment by performing ChIP-seq with wild-type and DNA binding deficient FOXM1. RESULTS: An in vitro fluorescence polarization assay identified point mutations in the DNA binding domain of FOXM1 that inhibit binding to a FKH consensus sequence. Cell lines expressing either wild-type or DNA binding deficient GFP-tagged FOXM1 were used for genome-wide mapping studies comparing the distribution of the DNA binding deficient protein to the wild-type. This shows that interaction of the FOXM1 DNA binding domain with target DNA is essential for recruitment. Moreover, analysis of the protein interactome of wild-type versus DNA binding deficient FOXM1 shows that the reduced recruitment is not due to inhibition of protein-protein interactions. CONCLUSIONS: A functional DNA binding domain is essential for FOXM1 chromatin recruitment. Even in FOXM1 mutants with almost complete loss of binding, the protein-protein interactions and pattern of phosphorylation are largely unaffected. These results strongly support a model whereby FOXM1 is specifically recruited to chromatin through co-factor interactions by binding directly to non-canonical DNA sequences.We would like to acknowledge the Genomics and bioinformatics core at the CRUK Research Institute for the Illumina sequencing and the Proteomics core for the LC/MS-MS protein analysis for the RIME experiments. We acknowledge the support from The University of Cambridge and Cancer Research UK. The Balasubramanian Laboratory is supported by core funding from Cancer Research UK (C14303/A17197). SB is a Wellcome Trust Principle Investigator.This is the final version of the article. It first appeared from BioMed Central via http://dx.doi.org/10.1186/s13059-015-0696-

    Occupancy maps of 208 chromatin-associated proteins in one human cell type

    Get PDF
    Transcription factors are DNA-binding proteins that have key roles in gene regulation. Genome-wide occupancy maps of transcriptional regulators are important for understanding gene regulation and its effects on diverse biological processes. However, only a minority of the more than 1,600 transcription factors encoded in the human genome has been assayed. Here we present, as part of the ENCODE (Encyclopedia of DNA Elements) project, data and analyses from chromatin immunoprecipitation followed by high-throughput sequencing (ChIP–seq) experiments using the human HepG2 cell line for 208 chromatin-associated proteins (CAPs). These comprise 171 transcription factors and 37 transcriptional cofactors and chromatin regulator proteins, and represent nearly one-quarter of CAPs expressed in HepG2 cells. The binding profiles of these CAPs form major groups associated predominantly with promoters or enhancers, or with both. We confirm and expand the current catalogue of DNA sequence motifs for transcription factors, and describe motifs that correspond to other transcription factors that are co-enriched with the primary ChIP target. For example, FOX family motifs are enriched in ChIP–seq peaks of 37 other CAPs. We show that motif content and occupancy patterns can distinguish between promoters and enhancers. This catalogue reveals high-occupancy target regions at which many CAPs associate, although each contains motifs for only a minority of the numerous associated transcription factors. These analyses provide a more complete overview of the gene regulatory networks that define this cell type, and demonstrate the usefulness of the large-scale production efforts of the ENCODE Consortium
    • …
    corecore